PASSING TO THE LIMIT IN A WASSERSTEIN GRADIENT FLOW: FROM 

DIFFUSION TO REACTION 
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Abstract. We study a singular-limit problem arising in the modelling of chemical reactions. 
At finite e > 0, the system is described by a Fokker-Planck convection-diffusion equation with 
a double- well convection potential. This potential is scaled by and in the limit £ — > 0, 
the solution concentrates onto the two wells, resulting into a limiting system that is a pair of 
ordinary differential equations for the density at the two wells. 

This convergence has been proved in Peletier, Savare, and Veneroni, SIAM Journal on Mathe- 
matical Analysis, 42(4):1805-1825, 2010, using the linear structure of the equation. In this paper 
we re-prove the result by using solely the Wasserstein gradient-flow structure of the system. In 
particular we make no use of the linearity, nor of the fact that it is a second-order system. 

The first key step in this approach is a refornmlation of the equation as the minimization 
of an action functional that captures the property of being a curve of maximal slope in an 
integrated form. The second important step is a rescaling of space. Using only the Wasserstein 
gradient-flow structure, we prove that the sequence of rescaled solutions is pre-compact in an 
appropriate topology. We then prove a Gamma-convergence result for the functional in this 
topology, and we identify the limiting functional and the differential equation that it represents. 
A consequence of these results is that solutions of the e-problem converge to a solution of the 
limiting problem. 
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1. Introduction 

In a seminal paper in 1940, Kramers introduced a model of chemical reactions in which the 
system is represented by a Brownian particle in a potential energy landscape [Kra40]. In this 
model the wells of the potential energy correspond to stable states of the system, and a reaction 
event is the passage of the particle from one well to another. By analyzing the probability of such 
a reaction event in terms of system parameters, Kramers was able to improve existing formulas 
for the macroscopically observed reaction rate. 

Although Kramers does not state it in these terms, the central result in [Kra40] is a convergence 
result in the limit of large activation energy. In [PSVIO] we provided a first rigorous proof of this 
result in the case of Brownian particles without inertia. The present paper can be considered a 
sequel to [PSVIO], in which we address a question that was left unanswered in [PSVIO]. 

The issue hinges on the fact that the system of [PSVIO] is a gradient flow of a free-energy 
fimctional with respect to the Wasserstein metric. The proof of the main result made no use of 
this structure, however, and this led us to ask. Can we prove the same result using the structure 
of the Wasserstein gradient flow? 

This question is interesting for a number of reasons. The first is that the Wasserstein gradient 
flow is a natural and physically meaningful structure for this problem — we explain in Section 7 
what we mean by this. It can actually be argued that it is more natural than the linear structure 
that we used in the proof in [PSVIO] , and therefore it is also natural to ask whether this structure 
can be used. 

The second reason is that the Wasserstein gradient-flow structure is known to arise in an 
impressively wide range of models and systems (e.g. [CG04, AGS05, Sav07, BCC08, MMS09, 
GST09, GiglO, CDF+], just to name a few), and therefore any method that uses only the properties 
of this structure has the potential of application to a wide range of problems. Consequently, our 
approach here is to limit our use of information to those properties that follow directly from the 
gradient-flow structure. 

As a third reason, this work flts into a general endeavour to use gradient-flow structures to 
pass to the hmit in nonlinear time-evolving systems (see e.g. [SS04, Ste08, MRS08, MSll, ASZ09, 
Ser09]). The inherent convexity and lower-semicontinuity properties of this type of formulation 
provide handles for such limit passages that are similar to the well-known results for elliptic 
systems — as we show below. 

1.1. Kramers' problem. The motion of a Brownian particle in a one-dimensional potential 
landscape is described by the initial boundary-value problem (often called a Fokker-Planck or 
Smoluchovski equation [Ris84, p. 8]) 

dtPe = Ted^{diPe + ^Ped^H^ t > 0, ^ € E := [-1, 1], (1.1a) 

diPe + -^Ped^H = 0, i > 0, ^ = ±1. (1.1b) 

The unknown function is a time-dependent measure in 9i{{E) (the space of finite, nonnegative, 
Borel measures on the closed interval S = [—1, 1]), and this equation is to be interpreted in an 
appropriate weak form. 

In this paper we take the potential energy to be a double- well potential, with wells in ^ = ±1, 
and we follow the choice of [PSVIO] to truncate the domain at the wells, i.e. we take E! = [—1, 1] 
as the spatial domain (see Figure 1). For deflniteness we assume that H is smooth, even, maximal 
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Figure 1. A typical function H 

at with -ff(O) = 1, and minimal at ±1 with 77 (±1) = 0. Each of these assumptions can be 
relaxed, but that is not the purpose of this paper. 

In (1.1) two important constants appear. The potential H is scaled by 1/e, which creates 
the situation of large activation energy: the energy barrier separating the two wells is large in 
the limit e — >■ 0. As a consequence, the rate at which a particle passes from one well to the 
other is exponentially small as £ — 0; with the coefficient r^, which is defined in (2.2) below 
and which tends to infinity as e — >■ 0, we adapt the time scale to make the rate of transition 
asymptotically 0(1). 

The asymptotically large 'hump' of the potential H/e causes any solution of (1.1) to become 
singular in the limit e ^ 0. This is well illustrated by the unique stationary solution of unit mass, 



^£ e I L I [-1,1], 



(1.2) 



where is a normalization constant and is the one-dimensional Lebesgue measure (see Fig- 
ure 2). Since -ff(^) > for all ^ ^ ±1, the measure 7^ becomes strongly concentrated at the wells 
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Figure 2. The measure 7^, illustrated by plotting its Lebesgue density 



^ = ±1 as £ -)■ 0: 



7e 



70 



-0-1 + -oi. 



(1.3) 



2 2 

In [PSVIO] we proved a number of results. The first is that the sequence converges^, in the 
sense of measures, to a limit measure po, whose support is restricted to the two points ^ = ±1: 



The densities 



Pe Po 

[0, T] — > M of this limit measure po satisfy the limit equation 
dtu^ = k{u^ - ?io ) 
dtu^ = fc(wo ~ "0 )• 
where the rate constant k is given in terms of the potential function H by 

k=-^\H''iO)\H"{l). 



(1.4a) 
(1.4b) 

(1.5) 



This limit system corresponds to the natural modelling of the monomolecular reaction A ^ B at 
the continuum level. 



^The result of [PSVIO] uses a slightly different definition of t^, which is asympotically equivalent to the one this 
paper, (2.2). 
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A second result states a stronger form of convergence, and also highlights the role of the 
density of the measure with respect to 7^, i.e. 



die 



which satisfies the dual equation 



(1.6) 



(1.7) 



Figure 3 illustrates the relationship between and u^- As it turns out, is much better behaved 

OWe) 
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Figure 3. A comparison of and the density = dpe/dje- 

than /?£ in the limit e ^ 0: if the initial datum for is bounded above and below, then the same 
holds for Ug by the comparison principle, since constants are solutions of (1.7). In addition, 
becomes locally constant away from ^ = (see part 3 of Theorems 3.1 and 3.2 below). This is 
reflected in a stronger form of convergence for u^, proved in [PSVIO], which implies in particular 
that nonlinear functions of also converge. 

The aim of this paper is to derive similar convergence statements by different methods, specifi- 
cally, by using only the structure of the Wasserstein gradient flow. Before describing this structure 
for the specific case of (1.1), we first recall the general structure of a gradient flow in a smooth 
and finite-dimensional setting. 

1.2. Gradient flows in a smooth Riemannian setting. Let us consider a smooth d-dimensional 
Riemannian manifold Z, a energy functional £ : Z ^ R, and a quadratic dissipation potential 
ip induced by the Riemannian metric on Z. In local coordinates, we can identify Z (and the 
tangent space Tz{Z) at each point z G Z) with K'' endowed with a smooth Riemannian tensor 
G{z) :M'^(= T^(^)) ^K''(= T*(Z)) in the form V(i; ^) = ^{G{z)z,z). 
The gradient flow of £ in .E is then given in the form 

z{t) =v{t) €T^(^t)iZ), where u(t) = -Vg^ (z(i)) or G{z{t))v{t) = -T)£{z(t)). (1.8) 

Here and elsewhere in this paper we use overdots for time differentiation and D for the Frechet 
derivative of a function (an element of T*(Z) in the Riemannian setting). The gradient Vg^ 
is defined as usual via the metric as 2: M- G{z)~^D£{z). It will sometimes be easier to use 
the dual dissipation potential tjj* given via the Lcgrendrc transform with respect to z, namely 
tp*{r];z) = ^{ri,G{z)^^r]). Then the gradient flow (1.8) takes the form 

z = 'DiP*{-B£{z);z). (1.9) 

Here and below the derivatives and Dip* are only taken with respect to the flrst variable. 

Solutions of (1.9) in a time interval (a, 6) can be characterized as minimizers of the action 
functional 

A{z; a, h) := ^ V(i + Vcf (^); z)dt=^J^ {G{z){z + Vg£{z)), z + Vg£{z)) dt, 
defined on curves with values in Z. Expanding the integrand and observing that 

{G{z)z,VG£{z)) = {D£{z),z) = p{z), 



(1.10) 
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we see that A has the structure 

A{z\ a, h) = £{z{h)) - £{z{a)) + J{z\ a, b), 



J{z- a, 6) / ij{z{ty,z{t)) + r {-J^£{z{t)); z{t)) 



At. 



(1.11) 



Note that for every curve z we have A{z; o, 6) > 0, while A{z; a, 6) = if and only if z satisfies (1.8). 

1.3. Gradient flows in a metric setting. The functionals J and A can be generalized to 
infinite-dimensional and non-smooth settings given by a space Z with a lower semicontinuous 
(pseudo-, i.e. possibly taking the value +oo) distance 6 : Z x Z ^ +oo]. In such a space both 
tangent spaces and derivatives might not exist. Instead one can turn to two metric concepts, the 
metric slope \d£\ of the functional £ and the metric velocity \z\ of a curve. The metric slope 
generalizes {2-^* {—Tf£[z); z)Y/'^ and is defined by 

\d£\{z) :=limsup (1.12) 

Instead of defining a dissipation potential V' on the tangent space of an arbitrary point of Z, one 
considers the class AC{a, b; {Z, d)) of absolutely continuous curves (with respect to the distance d) 
and their metric velocity 

\z\it) := hm + iiz^ACia,b;{Z,d)), (1.13) 

h->o \n\ 

which exists for a.e. t G (a, 6) [AGS05, Th. 2.1.2]. 

Using these concepts, the natural generalization of J in (1.11) is 

J{z;a,b):= [^Iz^t) + ^\d£\\z{t))\dt if z e AC{a,b; {Z,d)), (1.14) 

trivially extended by +00 if z is not absolutely continuous. Assuming that the slope is a strong 
upper gradient for £ [AGS05, Ch. 2], it is not difiicult to prove that 

J{z; a, b) > £{z{a)) - £{z{b)) for every curve z G C{[a, b];Z) with £{z{a)) < +00. (1.15) 

Comparing with the classical case outlined in Section 1.2 we deduce the following common struc- 
ture: 

Definition 1.1. Let Z be a topological space, £ : Z (—00, -l-oo] be a Junctional, and let J{-;a, b) 
be a nonnegative (extended) real functional defined on C([0,T];Z) for all < a < b < T, and 
satisfying 

£{z{b)) + J{z]a,b)>£{z{a)) for every z € C{[a,b]; Z). (1.16) 

Writing 

A{z) £iz{T)) - f (z(0)) + J{z; 0, T), 

we define a curve z G C{[0, T];Z) to be a solution of the gradient flow system {Z, £, J) if£{z{0)) < 
00 and A{z) = 0. 

This formulation of a gradient flow, in terms of the functional A, will be the basis for the rest 
of this paper. It clearly contains the classical case of a gradient system {Z, £, d), for which J' can 
be defined via (1.12)-(1.14), and the metric-space outlined above, and it is sufficiently general to 
contain also the structure of the limiting problem (see Section 1.7). 

1.4. A first gradient-flow structure for (1.1): the Hilbertian approach of [PSVIO]. We 
now turn to the specific case of this paper, equation (1.1). It is well known [Bre73] that equa- 
tion (1.1) in the density formulation (1.7) is the gradient fiow of the Dirichlet form 

^""M := I /' d7e, (1.17) 
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in the weighted Hilbert space = 7e). In this approach the quadratic dissipation potential 
(which is also the squared metric velocity) of a curve u is 

Vr(«;u) = I ff^le = ImUe;,^), (1-18) 

and does not depend on u, so that the resulting space has a flat Hilbertian geometry. 
The limit ODE (1.4) has a similar linear structure, given by 

4- = {{-1,1}; jo) = {(4,«o)eM^4=«o(±l)}, 



{-1,1} 



The rigorous transition from (1.18) to (1.19) is established in [PSVIO] in a more general setting 
where diffusion in physical space is allowed as well. The analysis in [PSVIO] depends in a crucial 
way on the linearity of the problem. 

1.5. An alternative gradient-flow structure for (1.1): the Wasserstein approach of 

[JK097]. As was discovered in the seminal work by Otto and co-workers [JK097, OttOl], equa- 
tion (1.1) has another relevant gradient structure. It relies on the interpretation of p as a mass 
distribution which is transported such as to reduce the free energy. 
In order to describe this point of view, we introduce 

:=5W'(S), £:f-(p):=^tilog«d7.-p(S)logp(S), where «:= (1.20) 

with the convention that f ''°°(p) = +oo if p is not absolutely continuous with respect to jg. The 
space Z™'^^ is endowed with the usual weak-* convergence of measures (i.e. convergence in duality 
with continuous functions) and can be metrized by the L'^ - Wasserstein distance d\y- 

This distance dw admits two nice characterizations: the first one involves optimal transport 
(see e.g. [Vil03, AGS05] ) , while the second one is related to the dynamical interpretation discovered 
by Benamou and Brcnier [BBOO] and is well adapted to the gradient-flow setting. 

In the latter point of view, we introduce the class CE(a, 6; S) (Continuity Equation) given by 
couples p e C{[a,b];Z"""'^) and ly e M{{a,b) x S;M) such that 

dtp + d^v = in the sense of distributions in ay {{a, b) x R). 

Here we trivially extend p by zero outside of S. Often v = pv for some Borel velocity field 
u : (a, 6) X S — >• K, in which case the conditions above reduce to 

(■b f 

\v{t, 01 p{t, dC) dt < +00 and 



(1.21) 

dtp + d^{pv) = in the sense of distributions in 'D'{{a, b) x M). 

For those couples (p, v) G CE(a, b; S) such that there exists such a velocity field v with u = pv, 
the distance dw can be defined in terms of v, by 

d^{po,Pi)=mmy^Jjv{t,^)\''p{t,dOdt:{p,pv) e CE(0, l;S),p(0) = po,p(l) = Pij, (1.22) 

which illustrates how we can interpret v as the 'Wasserstein velocity' of the curve p. Note how 
finiteness of dw requires that z/ <C p and du/dp G L'^{p), implying that CE(a, b; S) is a larger space 
than AC{\a,b];dw)', indeed, our choice to work with CE(a, 6; S) stems from the fact that in the 
limit e = the objects will still be elements of CE(a, b; S), but no longer of AC{[a, b];dw)- 
Recalling (1.13), it is natural to introduce the dissipation potential 

^f-^^{v;p):=^J^v{0'p{dO, iovpe!M{E), v&L'{E;p). (1.23) 

This expression suggests to interpret L^(S; p{t, •)) as the 'Wasserstein tangent space' at the mea- 
sure p{t, •), and in [AGS05, Ch. 8] this suggestion is made rigorous. 
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The corresponding (squared) slope of S^'''^^ [AGS05, §10.4.4] defined by (1.12) is the Fisher 
information 

\dS^"'%p)\'^ := I dp = 4^^ \d^V^\^ d7„ if w = ^ with e W^'^{-1, 1). 

This corresponds to the choice of the dual dissipation potential 

and of the 'Wasserstein gradient' Vw^l'^'^^ of the entropy given (at least formally) by 

Vw£l'^%p):=^=d^logu, u=^. 

U ^ die 

This construction is equivalent to (1.1): in fact, at least for smooth densities, 

de.P + \p^i^ = Pdi log (^) 
so that (1.1) has the gradient flow structure (1.8) in the Wasserstein sense: 

dtp + d^{pv)=Q, v = -T,Vw£'r ip) > 
Motivated by these remarks, we introduce the functional J^^^, 



J^^^^{p;a,b) :-- 



if {p,pv) e CE(a,6;E!) and p = uje, (1.24) 



and the corresponding Wasserstein action functional 

^wass(p. ^) £f-(p(a)) - £f '=°(p(6)) + J^^^%p; a, 6), (1.25) 

which satisfies the admissibility condition (1.16). In analogy to Definition 1.1, a Wasserstein 
solution p of (1.1) in the time interval [0, T] is a curve in fW(S) with S^'^^'^ {p{0)) < +oo and 

A(p;0,r) = 0. 

1.6. Our main results. In this work we prove various results on the connection between the 
Wasserstein gradient structure (^meas^ £;free^ j-Wass-j g^^^j gradient structure {Z^'^^^ , , Jq) for 
the limit system (1.4). As described above, the motivating question is whether we may pass to the 
limit in the gradient-flow equation Ji^'^^'^{p^) = 0. This question falls apart into two sub-questions: 

(1) Compactness: Do solutions of J^^^^{pg\ 0, T) = with uniformly bounded initial entropy 
£^J'^'^{p^{Q)) have beneflcial compactness properties, allowing us to extract a subsequence 
that converges in a suitable topology, say cr? 

(2) Liminf inequality: Is there a limit functional Jo such that 

Pe ^ Po =^ liminf ^^^(p,; a, b) > Jo(Po; a, 6)? 

And if so, does it satisfy the admissibility condition (1.16), i.e. 

^o^'iPic^)) - ^o^'ipm + Mp; a,b)>0 
for every 0<a<&<TandpG C{[a, b];Z^''^'') with S^'^^ipia)) < +oo7 

Our answer to these questions is indeed affirmative. Question 1 is answered by Theorem 3.2, 
which establishes that any sequence pe such that f|™''(pe(0)) and J'^'^^{pe;Q,T) are bounded, is 
compact in several topologies. These boundedness assumptions are natural, and only use infor- 
mation associated with the gradient-flow structure. 

Question 2 is addressed by Theorems 4.1 and 6.1, which characterize the limit of the functional 
j-Wassj-.. jjj^ terms of Gamma-convergence. If we denote by a the topology mentioned above, 
then this convergence is characterized by the existence of functional j7o and Sq'^^ satisfying the 
two properties 
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(1) Lower bound: for each family of curves Po with sup^ £^^^'^{ps{a)) < +00, we have 



Jo(po; a, b) < Uminf J^^^^p,; a, b) and £'^''%po{b)) < liminf 

(2) Recovery sequence: for each po € C{[a,b]] Z™'^'^^) with fo™''(po(a)) < +00, J7o(po;o, < 
+00 there exists a sequence G C{[a, b];Z'^^'^^) such that po and 

Jo(po; a, b) - lim a, &), St^Poib)) - lim 

The limit structure {Z™°^^, Sq^'' , Jo) consists of measures p that are absolutely continuous with 
respect to 70 and thus supported in { — 1,1}: 

p = -u~6-i H — u'^Si for some > 0. 

The space Z™^^ and the energy £q^^ are natural limits of the corresponding objects as £ ^ 0: 

Z^^'^ = {p e : supp(p) C {-1, 1}} C Z"'^'^, 

^o^^ip) = ^ ^log (^^) d7o -mlogm = ^(^M+logM+ +M"logu"^ - mlogm (-2 26) 

where = -^(±1), m = p{E) = ^-{u'^ + u~). 
d7o 2 

This limit energy Sq^'^ is the Gamma-limit of £'f''<=° [ASZ09]. However, the limit functional 
j7o(-; a, does not have the same duality structure as (1.11), and we discuss this next. 

1.7. The structure of JTq. In fact, since the limit problem is characterized by measures p{t) 
concentrated at ^ = ±1, no effective mass transport is possible between ^ = — 1 and ^ = 1. 
Assume for instance the case when p is sufficiently smooth, i.e. p{t) = ^ '^u^{t)S±i = wjo for a 
couple e C^((a, &);M). Then the distributional time derivative of p is dtp = \ X^w^<5±i and 
any signed measure v supported in S x [a, b] and solving the continuity equation 

dtp + d^v = Q in ®'(R X (a,6)) (1.27) 

cannot be absolutely continuous with respect to p and therefore cannot admit the decomposition 

u = pv for some v € L^(— 1, 1; p) (except for the trivial case = 0). 

Recalling that the total mass m = |(u~ + W^) is conserved and therefore u~ = —u~^, equa- 
tion (1.27) has the unique solution 

iy = wL\_i^i^^(^^b), w{i,t) = ^u+{t) for^e (-1,1), i e (a,&), (1.28) 

trivially extended to outside [—1,1]. 

As we show below, j7o(p; a, b) has the form 

Jo{p;a,b):= [ M{w{t),u^{t)) dt (1.29) 



if p(i) = ^ ^u='=(i)(5±i, u"^ G AC{a,b;R), with w = =-^u+, 
where the function M : M x [0, 00)^ [0, -|-oo] is given by 

M(u;,u±) := inf I / -^-h"^ ds : u € k), u(±k) = 1 , k := |. (1.30) 

I 2^(5) 2u{s) J k 

This functional Jo satisfies the admissibility criterion (1.16). Indeed, along any admissible 
curve p(t), 

^^£^'^%p{t)) = Ai^«±(i)log«±(t) = i^(logu±(i) + l)«±(i) 
= ^[logu+{t)-logu-{t)]u+{t), 
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SO that the admissibihty condition (1.16) is equivalent to 

(logu'*' — logu~)w < M{w, u^) for every w e M, > 0. (1-31) 

In Theorem 5.2 we prove this inequaUty, implying that the limiting action Ao, 

Aoip) := S^J^'ipiT)) - £:f^<^(p(0)) + Mp), (1.32) 

satisfies Ao{p) > for all p. 

It is now natural to ask which curves p satisfy the equation AQ{p;a,b) = 0. This equation 
implies equality in (1.31), which suggests defining the 'contact set' [MRS09] 

C { iu'^,w) I M(u;,u=^) + (logu+ - logu~)w = }. 

A consequence of a second inequality proved in Theorem 5.2 is 

k 

{u^,w)€C <S=^ ti; = -(u+ - u~). 
This implies that any p satisfying Ao{p; a,b) = also solves the limiting equation (1.4). 

1.8. Recovering a gradient flow. Finally, one might ask whether it is possible to find a 'true' 

gradient structure, i.e. an alternative functional that does have the dual structure as in (1.11) 
or (1.14). For this we need to find a dissipation potential tpo{w;u'^) such that the associated 
contact set is equal to C, i.e. such that 

Using the two-sided estimate of Theorem 5.2 for M we find that a natural choice for ipo is 

± 21ogu+-logw- 2 

ipo{w;u ) = - w 

k u+ — u 

which gives the desired result (1.4). 

1.9. The variational approach: basic tools and main ideas. As we mentioned before, in 
the present paper we adopted a metric-variational approach to extract crucial information from 
the particular structure of the equation (1.1). This point of view has become quite popular and 
arises from 

• the combination of general metric concepts (briefly recalled in Section 1.3), 

• basic measure-theoretic tools, optimal transportation, and entropy-dissipation techniques 
(a la Otto [OttOl], see Section 1.5), 

• a careful use of the continuity equation (1.21) and of the Benamou-Brenier dynamical 
point of view [BBOO], 

• standard F-convergence methods. 

On the other hand, the problem exhibits many non-standard features, which we have addressed 
with new ideas and techniques, which could hopefully be useful in other situations. 

• One of the main points here is that the limit procedure mixes in an unusual way the 
two contributions to Jg, namely that due to time change {ip{z) in the notation of Sec- 
tion 1.2) and that due to the entropy slope {'ip*{—D£)). Therefore one cannot canonically 
separate these two contributions in the structure of JTq. This fact has another interesting 
consequence: in the present setting it is not possible to investigate separately the limit 
behaviour of the distance and of the functional using F-convergence tools (as in the well- 
behaved gradient flows considered by [Sav07, ASZ09, Ser09, PSVIO]). Conversely, the 
geometry perturbed by the sublevels and by the slopes of the varying entropy functionals 
£;free i^fjuces a new kind of evolution in the limit, which can solely be captured by consid- 
ering the asymptotics of the whole space-time functionals J^. 

This singular behaviour motivated our general Definition 1.1 of a gradient flow system 
{A, £, J) and the idea to focus on basic structural dissipation inequalities along solutions 
of the continuity equation which could be preserved in the limit. 
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• A second crucial point is the rescaling strategy (Section 2) which allows us to resolve 

the singular behaviour of the functionals along sharp transitions. Since the continuity 
equation has good invariance properties, we can combine the information coming both 
from the original and from the rescaled formulation to construct the limit. 

• As a byproduct, we recover a (quite surprising, at first sight) common structure uniting 
(1.1) and (1.4) thanks to this measure-theoretic interpretation: this is not completely 
trivial, if one takes into account that the limit problem is a system of ordinary differential 
equations. 

• This fact is strongly related to the new variational formulation of reaction-diffusion sys- 
tems, see [Miell, GMll]. In fact, it turns out that many reaction systems (even with 
nonlinear reactions of mass-action type) can be written as a gradient system with a well- 
chosen dissipation functional i/jq . While for linear reaction systems such as here (or more 
generally Markov systems) there are several variational gradient structures, this new for- 
mulation appears to be the only one that is compatible with diffusion processes. 

• A final comment should be devoted to the connections with stochastic particle systems, 
which we will explain in more detail in Section 7. 



1.10. Structure of the paper. In Section 2 we introduce a rescaling of space that desingularizes 
one of the terms in J^. This rescaling allows us to prove, in Section 3, the compactness of a 

sequence with bounded initial energy fe(p(0)) and bounded J^e{Ps) in a- number of topologies. 
Sections 4 and 6 give the two parts of the Gamma- convergence result, the lower bound and the 
recovery sequence. Before constructing the recovery sequence we investigate in Section 5 the 
function M in some detail. These are the central mathematical results of the paper. 

In Section 7 we place the results of this paper in the context of large-deviation principles for 
systems of Brownian particles, and comment on the various connections. In Section 8 we discuss 
various aspects of the results and their proof and comment on possible generalizations. Finally, 
in Section 9 we draw parallels between this work and an independent study of the same question 
by Herrmann and Niethammer [HNll]. 



Summary of notation 

— ^ weak convergence in duality with continuous functions 

CE(-, ■; ■) pairs (p, v) satisfying the continuity equation (1-21) 

■) Wasscrstcin distance of order 2 (1-22) 

£e, Je, and Ae (Section 1) general energy, dissipation fuctional, and action 

£s, Js, and Ae (Sections 2-9) Wasserstein energy, dissipation fuctional, and action, 

i.e the same as J^ass^ j^w^ss {1.2Q), (1.24), (1.25) 

£e, Je, and Ae £e, Je, and Ae written in terms of p (2-8), (2.12), (2.13) 

£o, Jo, and Ao limit energy, dissipation, and action (1-26), (1.29), (1.32) 

7e, 7e invariant measure (7e) and its push-forward under Se (1-2) and (2.5) 

Qe, Qe Lebesgue densities of 7e and 7e (2.1), (2.6) 

H 'enthalpy function', potential for the Brownian particle page 3 

k = l/K reaction parameter (1-5) 

M(-,-) argument of the integral in J7o (1.30) 

Se transform from ^ to s, inverse of ^e (2-3) 

Te time rescaling (2.2) 

Me density dpe/d'je (1-6) 

Me transform of Me, Me = Me o ^e (2-7) 

We Lebesgue density of VePe (2-15) 

We Lebesgue density of Vepe (2-10) 

^e transform from s to ^, inverse of Se (2.3) 

Zg normalization constant of 7e (1-2) 
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2. Rescaling 

2.1. Definitions. Prom here on we write £e, Je, and Ae for ff'^^, jT^^ass^ ^^j^^ ^^ass^ 

since we 

will only be using the Wasserstein framework. Since for most of the discussion the interval (a, b) 
will be fixed to (0, T), we will also write J{p) for J{p; 0, T) etcetera. 

A central step in the analysis of this paper is a rescaling of the domain which stretches the 
region around ^ = 0. This converts the functions u^, which have steep gradients around ^ = (see 
Figure 3), into functions iie of the new variable s that will have a more regular behaviour. 

We call Qe the Lebesgue density of 7^, namely 

.9,(0 := Z7lc-^(«)/^ and we set k:=\= VI-^^^WI ^^1) ^^^^^^ (2.I) 

We now make the choice of precise: 

An application of Watson's Lemma gives the asymptotic estimate 

Using that is even (since H is even), we introduce the smooth increasing diffeomorphism 

/•« 1 , _ 

Se : [-1, 1] [-K, k], SeiO := / 1—r dr], with inverse ■= s^^ ■ [-k, k] -> [-1, 1]. 

(2.3) 

Note that satisfies 

^Je{s)=T,g,iUs)), |e(-K) = -l, and |e(«) = 1- (2.4) 
With this change of variables we call S := [—k,k.] the domain of the variable s and we set 

% ■■= (Se)#7£, Pe ■■= {Se)#Pe- (2.5) 

Observe that the Lebesgue density of 7, satisfies 

UUO)^^UO = 9e{0 so that g,{s)=Tegl{ie{s)), (2.6) 
and the transformed measure pe satisfies 

Pe = Ue%, Ue:=UeO^^. (2.7) 

In particular, we can easily transport the entropy functional to the new setting 

£s{p):= u{s)logu{s)%{ds), so that £e{p) = £e{p) if p = {se)#p. 
Js 

If {p,pv) e CE(a, 6;S) then the couple {p,pv) with v{s) = v{^{s))/{Tsge{^e{s))) belongs to 
CE(a, b; S) and satisfies the continuity equation 

dtp + d,ipv) ^0 in ©'((0,r) X M); (2.9) 

in fact, since f (^(s)) = v{s)^'^{s), for every ^ = o ^ g C^((0, T) x [0, 1]) we have 

= r r (dt^ + V d^^) (do dt= r f ((a*.^) oi,+{vo i,) (d^^) o ptids) dt 

= (dt^ + t V {d^ct>) o i,) Ptids) dt = (dt4> + V d,^) pt{ds) dt. 

Setting w := V ii (je wc also have 

I/rhi&W = ll/Si^^s = m d. (2.10) 



(2.8) 
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Since 

we also get 



dsVii = {d^Vu ° 4) i'e = T£ {d^Vu o 4) o 4 



2t, 



9f ^/u 



d7e = 2 



! 1 

^ d7, - 2 



ds. 



Combining (2.9), (2.10), and (2.11), we now define the functional 



9 / -^ds + 2 



ds ) dt, u—-J^, w = uvqp, 
' ' d7e' 



and {p, pv) — {p,wL^) G CE(a, 6;S). This calculation shows that 
Je(p;0,T) = J,(p;0,r), and 

^.(P; 0, T) = A,{p; 0, T) = - Up{a)) + Xip; 0, T). 



(2.11) 



(2.12) 



(2.13) 





i slope 0(1) 



s = 



s = 1 



Figure 4. The transformations from ^ to s and from Ue(^) to Ue{s). The left- 
hand graph shows the bijection between ^ € [— l^l] and s e [0,1]. The right- 
hand graphs illustrate how the transformation (2.7) expands the region around 
^ = and converts the function with a near-discontinuity around ^ = into a 
function that has a slope of order 0(1). 



Remark 2.1. The desingularizing effect of the transformation from to can best be 
recognized in the last term in (2.12). In terms of u^, this term is the i7^-seminorm of ^/u^, 
and indeed boundedness of implies boundcdness of y^Ue in L^(0, T; i7^(S)) (sec the proof of 
Theorem 3.2). Compare this with the corresponding term in the non-transformed version (2.11), 
where the vanishing of t^^^ close to ^ = allows for large gradients at that point. 

As an independent way of viewing the effect of the transformation, the equation (1.7) for 
transforms into the equation 

Cje dtUe = dssUe 

for u^. Here the structure of the term d^iii- (specifically, the lack of singular terms inside the 
derivatives) is related to the better behaviour oi u^. □ 
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2.2. Formal analysis. The most complete understanding of the limit e — )• 0, and how becomes 

converted into Jq^ arises from considering the rescaled function u^. However, one can also gain 
some understanding from a formal analysis in the original variable ^, which we outline here. 
Since {ps,PeVe) € CE(0, T;S), we have 

gedtUe + d^{UeVege) = Q, (2.14) 

which can be partially solved to find 

MO ■■= U,{0Ve{09ei0 = j\ Ue9e. (2-15) 

Since vanishes in the interior of [—1,1], we expect that We becomes constant in ^ e (—1,1), 
with value 

Writing the first term in (1.24) in terms of w^, and using the convergence of to the constant w, 
we find that the two ^-integrals become 

7^ —d^+^ —9e « ^ + o / —re9ed^. 

2Te i_i «e5£ 2 J_i Ue 2 J _i T^g^ 2 J_i 

This expression is already close to the definition of the function M in (1.30). The remaining 
difference, the two factors T^gE in the two integrals, is removed by the transformation (2.3) from 
^ to s. 

One can go further, and reconstruct how the limit equation (1.4) arises in the ^-variable. For 
small e, the large value of implies that for fixed t the function ^ Me(^, t) is close to a stationary 
solution of (1.7) with Dirichlet boundary conditions at ^ = ±1. Writing this equation as 

gedtU^ = Te(95(ge55Ue), (2.17) 

it follows that (writing u± for the boundary values) 

where we have used the characterization (2.2) of r^. Comparing (2.17) with (2.14) we remark that 

SO that 

1 . (2.16) „ (2.18) U+ - U- (2.1) 1 , , , 

-U- = w ^ We = TegeO^Ue ~ = -k{u+ - U-), 

1 Zk 2, 

which coincides with (1.4a). 



3. Compactness 

The main results of this section. Theorems 3.1 and 3.2, describe compactness properties of 
sequences p^, and their transformed versions Pe, for which the initial energy ^^(^^(O)) = f£(pe(0)) 
and Je{Pe) = Jsipe) are bounded. 

Let us first comment on what one might expect. For pe and pe the limit objects are measures 
Pq and po concentrated in {—1,1} and {— k, «;}. The existence of converging subsequences is a 
simple consequence of the bounded total variation of the measures and the bounded domain of 
definition. However, this convergence alone does not contain enough information for the lower 
bound result of Theorem 4.1. 

The key to obtaining more detailed convergence statements lies in using the objects that appear 
in Je and Je, which are We ■= VeUege, We, d^Ue, and dgUe- Boundedness of Je{Pe) implies, using 
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the definitions (1.24) and (2.12), that there exists a constant C > such that 



Ts lo Is 



didt<C, Tel / ^7e(d0dt<C, (3.1) 



Ue9s Jo Js U, 

T 



dsdt<C, / / -^dsdt<C. (3.2) 



Me 

If the sequence iie also happens to be bounded in L°°, then the bounds (3.2) imply weak compact- 
ness of We and dgUe in i^((0, T) x S). Also, since re7e is unbounded in any set not containing ^ = 0, 
(3.1) suggests that should become constant in [—1, 0) and (0, 1]. In Theorem 3.1, where wc make 
this additional assumption of boundedness in we show that the remarks above indeed are true. 
Moreover, we shall see that we can recover the canonical decomposition po = \u~5-i + \u^5i by 
taking the limit of the traces of the densities at ^ = ±1, and similarly for p^. 

When Ue is not assumed to be bounded in L°° , singular behaviour is possible that violates 
the L°° bound but influences neither the boundedness of energy and dissipation nor the limit 
object pq. We treat this case in Theorem 3.2. 

Theorem 3.1 (Compactness under uniform L°° bounds). Let p^ = Ue'Js € C([0, T];Z™^'^) satisfy, 

for suitable constants m, C > and for all e > 0, the estimates 

Pe(t,E) = m for allte[0,T], £e{pe{0))<C, Je{Pe)<C, and ||ue||co < C. (3.3) 

Then there exists a subsequence (not relabeled) and a limit pq = Mo7o S C{[0,T]; Z™'^"''^) such that 
the following hold: 

(1) pS) ^ Po{t) in for every t € [0,T]; 

(2) The spatial traces Ue(-,±1) are well-defined and converge strongly to Uq{-) = uo{-,±l) in 
L\Q,T); 

(3) For all Q < 5 < 1 the function converges uniformly to the limiting trace values in 
Li(0,T;i°°(-l,-5)) andL^{Q,T-L'^{5,l)). 

Let lie ^6 transformed, sequence and let We be given as in (2.12). Then there exist limits 
Ho e L°°((0,T) X S)nL2(o,T;VKi'2(S)) and ivo G L^{{0,T) x S) such that 

(4) Ue uo in L°°((0,T) x S), and dgiie dgiio, We wq in L^((0,T) x S); 

(5) the traces ofuo{-, s) at s = ±k coincide with the traces ofuo, i.e. they satisfy = 

m (0,T); 

(6) Pe ^ Po{i) = \{uQ{t)5-^ + u+{t)5^) m 9^(5) for every t e [0,T]; 

(7) wo{t,-) is constant in S for a.e. t G (0,T) and satisfies wo{t,-) = ^iioit) a.e. in (0,T). 
The couple (poji^o), i'o = '"^o-^^|(o,t)xS satisfies the continuity equation 

dtPo + dsOo = in ©'((0, T) x M). (3.4) 

We will see in Theorem 5.4 that in the special case of solutions of (1.1), which satisfy As{pe) = 
rather than Ae{pe) < C, the limit object uq is a linear interpolation of the values at s = ±k. 

Proof. We divide the proof in a few steps; we will denote by C various constants which are 

independent of e. 

Step 1: Entropy estimates. There exists a constant C > such that 

£e{Pe{t)) = SeiPeit)) < C for cvcry t e [0,T]; (3.5) 
in particular, for any subsets A d (—1, 1) and A d (— k, k) we have 

lim sup pe{t, A) = 0, lim suppe(i. A) = 0. (3.6) 

It is sufficient to prove (3.5) for the unrescaled measures p(t). First we note that Ee is nonnegative; 
denoting by p(t) := p(T — t) the time-reversed curve, since Jeifi) = Je^p), the bounds (3.3) and 
(1.16) imply that Ee{Pe{t)) < C for every t e [0,T]. 
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Property (3.6) follows from (3.5) and the fact that lim^j^o 7e(^) = l™£4.o 7e(^) = 0- Considering 
e.g. the case of A d (—1, 1), by using first the inequality r log r > — e~^ and then Jensen inequality, 
we get for every A (e (—1, 1) 

-j,{{-l,l)\A)+£Mt))+mlogm > [ u,{t,OlogUe{t,Ole{d^) > Pe{t,A)log (^4^)' 
e Ja ^ l£(A) ) 

(3.7) 

so that 7e(^) — >■ implies supj pe(t, A) — >• as £ — 0. 

Step 2: Estimates on u^,Ws. There exists a constant C > such that 

||we||oo<C' and / [ wS^s^ dsdt < C. (3.8) 
^0 is 

The first bound derives from assumption (3.3), and the second follows easily from (3.2) and the 
L^-bound on ii^. Wc state these here to contrast with the corresponding, weaker, versions in the 
proof of Theorem 3.2. 

Step 3: Pointwise weak convergence of Pe{t) (statement 6): there exists a sequence 
£n iO and a limit po{t) <^ % such that Pe„{t) — ^ Po{t) for every t e [0,T]. 

Starting from the continuity equation we have for every <to <t-i <T and € C^([0, 1]), 

/ (pdps{ti)- [ (pdpe{to)= [ [ dsipwgdsdt. 
Js Js Jto Js 

Recalling the definition of the L^-Wasserstein distance [AGS05], we find 

dwAPe{ti),Pe{to)) :=sup|^^dpe(ii)-^<^dpe(to) : veCi([0,l]), \ds<fi\ < l| 

[ I \we{t,s)\dsdt< ^2K{ti-to)( I [we{t,sfdsdt 

Jto Js \Jo Js 

It follows by (3.8) that the curves t h- > p^j arc an cquicontimious family of mappings from [0,r] 
into the space !M{S) endowed with the L^-Wasserstein distance; since the total mass is m, the 
claim follows by the Arzela-Ascoli theorem. 

Step 4. Weak convergence of Pe(t) (statement 1). Writing the limit po of the previous 
step as po{t) = gUg (t)<5-K; + 5UQ'(t)5re, we have for every t G [0,T], 

Pejt) - po{t) = W-^-i + (3-9) 

Let us fix t e [0, T] and let us consider a subsequence of £„ along which (i) converges weakly 
to some p G fAf(S). By the result of Step 1, we know that p = ^u~5-i + for some u^: if we 

show that = u^{t) we have proved the thesis. Considering u~ and taking a function (p € C(S), 
we know that ips := (p o s^ converges pointwise to </'o(?) = 'P ° sq, where so(^) = sign(^) (with 
so(0) = 0) and the convergence is uniform on compact subsets of [— 1, 1] \ {0}. It then follows that 

\y^utm{±K) = lim / 0{s) p,.^{t,ds) = lim / fe'M Pe'St^^i) = I ^o(Op(dO 

Since (p is arbitrary, we obtain (3.9). 

This shows the intuitive result that the densities of uq and uq (with respect to 70 and 70) are 
the same; we call them both Uq . It mirrors the fact that the traces of (in ^ = ±1) and of 
(in s = ±k) are also the same. 

Step 5. Convergence of (statements 2 and 3): The traces uf^ = Us^{ - ,±1) strongly 
converge in L^{0, T) to the limits Uq defined in the previous step. In addition, setting u>f := ±{6, 1) 



< 



1/2 
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we have 



fT 

lim / sup |Me„ (t, - {t) \dt = for every < ^ < 1. (3.10) 



56w 

Let us first observe that the quantities 

■■= ^"'ff,^ = / «e„(i,0d7e(0, < ^ < 1, 

7eK ) ) = 

are uniformly bounded and converge pointwise to u^(t) for every t G (0, T) by Step 4. By (3.1) 
and the boundedness of we have 



lim/ |0f |2(t)dt = 0, where ef{t):= sup |«e(i,0 " "e(i,»?)| < / lajM^I' d$) 
We then calculate for ^, rj e u)f, 



\Mt,^)-uf^,sit)\<^^, [ \u,{t,0-u,{t,rj)\j,{dv)< 

7e(w5 ) J^f 



lim / sup \ue„{^,t)-uf s{t)\dt = 



) -/a;? 7e(W5 ) 

Recalling that 7^(0;^) 1/2 as e — >■ 0, we thus obtain 

rT 

which in particular yields (3.10) and the convergence of the traces, since uf^ g strongly converge 
to Uq in every L^(0,T), p < +00. 

Step 6. Compactness and limits (statement 4). Given the estimates (3.8) and (3.2), this 
follows from standard results. 

Step 7. Identification of the traces of uq (statement 5). Since the trace operator Tr is 

weakly continuous from H^(S) to L^({— k, k}) Ci: R^, the weak convergence of Ug^ in H^{S) implies 
that its traces Us^{-,±k) — Ue„(-,±1) converge weakly in L^(0,r;]R^) to Truo- Since «£„(•, ±1) 
converges strongly to Uq in L^{0,T) (Step 5), it follows that Ttuq = Mq . 

Step 8: The continuity equation and the structure of wq (statement 7). Passing to 

the limit in the continuity equation (2.9) and using the previous convergence result we immediately 
find (3.4). Since po is supported in [0,T] x {—«,«}, we obtain that dsV^ = in [0,T] x (— k, k), 
so that Wo depends only on t. 

Choosing a test function of the form Lp[t,s) = i\j{t)C,[s) with V e C~(0,T) and C S C^(M), 
^ = 1 on a neighborhood of k and ^ = on (—00, 0], we obtain 

\ [ i>{t)u+{t)dt= [ [ ij{t)C{s)po{t,ds)dt = - [ [ i>{t)C{s)wo{t)dsdt. 
^ Jo Jo Jm. Jo Js 

Since Jg('{s)ds = ({k) = 1, we conclude that Wq is the distributional derivative of ^Uq ■ This 
concludes the proof of Theorem 3.1. □ 



We now discuss the case where is not assumed to be bounded in L°°. A simple example 
shows how tie may become singular without affecting any of the relevant limit processes. Take any 

sequence with bounded ^^(^^(O)) and J'^{p^), and let Ug = dpe/dje be bounded from above 
and away from zero as well. Fix two nonnegative functions (f G Cg((— k, k)) and tp G C^{R), fix 
<to <T, and define 

pe{t,s) := Pe{t,s) + -^v(^-^)¥'(s)7e(s)- 

Note that since the additional term blows up polynomially, while % converges to zero exponentially 
fast on supptp, the limits of p^ and p^ are the same. For the same reason the perturbed Wg, 
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satisfying dfp^ + dgW^ = 0, onfy differs from by an exponentially small amount. Therefore 

f f 
lim sup / / ^ dsdt = lim sup / / ^ dsdt < oo. 
£^■0 Jo Js ^£ s->-o Jo Js 



We also estimate 

Jo Js '^e £ Jo V e / 7s 



The first term of this estimate is bounded by assumption, and the second is bounded by the scaling 
in £ and the assumption that Wg is uniformly bounded away from zero. 

This example shows that the assumptions of bounded initial energy and bounded J do not rule 
out singular behaviour of the sequence between —k and k. The example also suggests what 
form this singular behaviour might take: that of a singular measure in time (called A""- below), 
but with bounded total variation. This is exactly what we prove in the following theorem. 

Theorem 3.2 (Compactness, the general case). Let = Ugje € C([0, T]; ^'"™*) satisfy, for 
suitable constants m, C > and for all e > 0, the estimates 

p,{t,E) = m for all t e [0,T], £s{ps(a)) < C, and J^{p,) < C. 

Then there exists a subsequence (not relabeled) and a limit po = mqIo € C{[0,T];Z'^''°'^) such that 
the following hold: 

(1) pS) ^ Po{t) in fW(S) for every t € [0,T]; 

(2) The spatial traces Ue(-,±1) are well-defined and converge strongly to (•) = 'Uo(',±l) in 
L\0,T); 

(3) For allQ < 5 <1 the function converges uniformly to the limiting trace values in 
L^{0,T; L°°i-1, -5)) and L^{0,T; L°°{5, 1)). 

Let Ue be the transformed sequence and let We be given as in (2.12). Then there exist limit 
functions uq S L^{0,T;W^'^{—k,k)), wq € L^{{0,T) x S), a reference singular measure A"*" € 
5W"([0,r]) with A-L _L L^, and a function ttiq £ L^_,([0,T] x S) with d^rho G Ll^{[0,T] x S), where 
A-L = A-L O X^ls e fW([0,T] X S), such that 

(4) Us uo + rhoA-'- , dgUe dsUo + ds'rhoA.'^ , andw^ wq in the duality with C([0,T] x 

S); 

(5) the traces Uq ofuo{-,s) at s = ±k coincide with the traces of uq, i.e. they satisfy = u^ 
a.e. in (0, T); the traces rn^{t) ofrho{t,-) vanish for X^-a.e. t e [0, T]; 

(6) Ps ^ Po{t) = i(Mo W-^-K + u+it)S^) in fW(S) for every t G [0,T]; 

(7) U!o{t,-) is constant in S for a.e. t G (0,T) and satisfies wo{t,-) = ^u'^{t) a.e. in (0, T). 
The couple [poii^o), uq = u>o-^^|(o,t)xS satisfies the continuity equation 

dtpo + ds£'o = m !Z)'((0,T) X M). (3.11) 

Remark 3.3. Parts 1-3, 6, and 7 are the same as in Theorem 3.1. The main difference lies in 
the structure of the limits of Us and Ws (part 4) and therefore the identification of the traces of 
uo and mo (part 5). □ 

Proof. Some of the steps are the same as in the case of Theorem 3.1; for those we only give the 
statement. For the others we detail the differences. 

Step 1: Entropy estimates. There exists a constant C > such that 
£e{pS)) = £e{pS)) < C for every t e [0,T]; 
in particular, for any subsets A g (—1, 1) and A g (— k, k) we have 

hm sup pe{t, A) = 0, lim sup Ps{t, A) = 0. 

£->0 t s->-0 t 
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Step 2: Estimates on Us,We. There exists a constant C > such that 

[ sup\u^{t,s)\dt<C, [ [ \w,{t,s)\dsdt<C. (3.12) 
Jo ses Jo Js 

These bounds are weaker than the L°°-bound on Ug and the i^-bound on li^ of Theorem 3.1. Let 
us set := ^/u^: since € i^(0, T; W^-^{—k, k)) its traces at .s = ±k are well defined and belong 
to L^{0,T). We set 9e{t) ■= sup^^^gg \pe{t,r) —Pe{t,s)\. Standard estimates yield 

i-T 



Moreover 



and 



and therefore 



0s{t) < (2k j^\dsPe?dsy\ ^ el{t)dt < Je{Pe;0,T) < C. 

< I pI d7£ = m, 
Js 

Pe{t, s)< peir) d%{r) + ^e(i) < \/m + 6^) for every s e S, 
Js 



{lp.d%) 



sup Ue{t,s) < 2m + 2e;{t). 
The second estimate of (3.12) then follows from 



Mt,s)\ds<( / ^^^£|i^ds) ( / Mt,s)ds) 



step 3: Pointwise weak convergence of Pe{t) (statement 6): there exists a sequence 
£n -l and a limit po{t) such that Pe^iit) Po{t) for every t G [0,T], and po{t) <C 7o. 

Step 4. Weak convergence of Pe{t) (statement 1). Writing the limit po of the previous 
step as po{t) = (f)(5_K + ^UQ{t)Sf^, we have for every t G [0,T], 

Pe„W ^ Po{t) = ^UQ{t)d_i + ^U+6i. 

Step 5. Strong convergence of traces (statements 2 and 3): the traces uf^ = u^^{ ■ , ±1) 

strongly converge in L^{0,T) to the limits uf defined in the previous step. In addition, setting 
Lof := ±{d, 1) we have 

T 



lim / sup |ue„(t, ^) — Ug (f)| dt = for every < 5 < 1. 
The proof of this step is similar to that of Theorem 3.2, but uses instead the estimate on p^ := -s/u^, 
Ikn r \0f\^t)dt = O, ef{t):= sup M^,t) - p,{v,t)\ < (6 [ \d^p,\' d^'^'. 

Step 6. Compactness and limits (statement 4). Because of the lack of an bound, 

from here on the proof differs significantly from that of Theorem 3.1. Let us set l^it) := 1 + 
supjjgg \us{t,s)\. Up to extracting a suitable subsequence £ — >■ (without changing notation) we 
can assume that there exist weak limits A e 5W([0, T]) and /xq, J^O) "TO) ^o € fW([0, T] x S), such that 
(identifying functions with the corresponding measures) 

4 A, Us ^ fio, Ws ^ uo, dsUe ^ ^0, and \dsUe\ ^ ^o- (3.13) 

Since Us{t,s) < 4(0 we have /to < A := A(g)X^|s, so that /to = moA for a suitable bounded Borel 
function too e L^([0,T] x S). Since 

^ ''dsdt<C, 



JS 
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it follows (see Lemma 3.5 below) that in the limit uq <C /io and \%\ < ito ^ Ao- In particular 

i>o = noA and "fo = .90 A with no,^o G L^([0,T] x S). Since 9s/io = COj we easily have for every 
couple of test functions V S C°°([0, T]), ^ e C^{-k, k), 

1 1 ^{tMs) go{t, s) ds X{dt) = - [[ i'iWis) mo{t, s) ds X{dt). 

JJ{0,T)xS JJ{0,T)xS 

Since is arbitrary, we deduce that dsirioit,-) = go{t,-) in i°°(S) for A-a.e. t G [0,T]. We also 
deduce that 

"'^/ (!^ + K)dsX{dt)<C. (3.14) 
Is vmo mo/ 

The measure A € f7W([0, T]) can be decomposed as A = £L'^ + X-^ with £ € ^^(0, T) and A-^ ± 
and similarly A = £L^ (Si L^\s + A-"- with A-"- = A""- (8) X/^|s. We set Uq '■= rno^ and := no£, 
so that the limits in (3.13) can be decomposed as /to = wo + moA-"-, ^o = dgUo + dgiio^'^, and 
uq = 11)0+ no A-"- . In Step 8 below we show that the last term, no A-"- , vanishes. 

Step 7. p,Q and Uq have equal traces (statement 5). Let us consider, e.g., the case of —k 
and take nonnegative test functions tp G C{[0,T]) with supig[o,T] IV'(OI — 1' and (f € C{[—k,k]) 
with support in [— K,0) and integral 1, so that $(s) := J^(p{r)dr is decreasing, supported in 

[— K, 0), and satisfies <!>(— k) = 1. Wc also set (ps{s) := S^^ip{—K + S^^{s + k)), $5(5) := $(— k + 
(5~^(s + k)), </?5(s) = — $^(s). Denoting by ft the product [0,T] x S, we have 



[ i){t)uo{t)dt- [[ iP{t)ips{s) Mdsdt) < [ \uo{t)-uj{t,-K)\dt 
Jo JJn Jo 



+ // tpit)(ps{s)\uj{t,-K)-Ueit,s)\dsdt + 



4>{t)(ps{s)ue(t, s) ds dt ■ 



II m 

JJn 



ips{s) fio{dsdt) 



Passing to the limit as £ — ^ the first and third terms vanish; concerning the second term, we 
have 



Iln 



ip{t)(ps{s)\ue{t,-K) - Us{t,s)\dsdt < jj ip{t)(ps{s) J \daUe{t, r)\ dr^ ds dt 

= ^V'(i)(y ^s{sMuS,s)\ds)dt. 
Combining these inequalities and passing to the limit as £ — > wc get 

/ i>{t)uo{t)dt- II ^p{t)Ms) Mdsdt) < II ^p{t)^s{s)ao{dsdt), 
Jo JJn JJn 

so that, applying Lcbcsguc's dominated convergence theorem with the fact that $5(s) — 
s > —K and < $5 < 1, we obtain 



for 



lim 
54.0 



/ ij{t)uo{t)dt - II ^{t)ips{s)fiQ{dsdt) < II V(i)<7o(dsdi) = 0, 

Jo JJn J J[o,t]x{-k} 



since cto <C A. 

On the other hand, recalling that /io = mo A and mo € L\{0,T;W^'^{S)), an analogous argu- 
ment yields for ifiQ {t) := mo{t, —k), 

I ^P{t)mo{t)X{dt)- II ^{t)ipsis) fioidsdt) < // i>{t)ipsis)\mo{t) - moit,s)\ds X{dt) 
Jo JJn JJn 

< II <^s{s)\go{t,s)\dsX{dt) ^ 0. 
J Jn 



Since ip is arbitrary, we conclude that 



mj A = u^L^. 



(3.15) 



Step 8: Passing to the limit in the continuity equation (statement 7). This step is 
the same as in the proof of Theorem 3.1. 
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Conclusion: Vanishing of the singular part of i>o, i.e. WqX-^ = 0. Prom (3.15) it follows 
that m^{t) = for A-^-a.e. t € [0,T]. On the other hand, (3.14) yields for A-^-a.e. t e [0,T] and 
for every rj > and s{t) with mQ{s{t),t) > 0, 



+0O > 



wo{t)'^ , go{t,s) 



+ 



\J -2Js\r] 



dsrho{t,sy 

2 Js^'rho{t,s) ' mo{t,s)J ~ 2 Js\r] + mo{t,s) ' r] + mo{t,s). 



+ 



ds 



>\wo{t)\ / \d,log(Tj + mo{t,s))\ds>2\wo{t)\ log?] - \og{Tj + mo{t, s{t))) 
Js 

Since > is arbitrary, we conclude that woit) = A-'--a.e. t e [0, T]. 



□ 



The Lemma below is similar to many other duality results (see e.g. [AFPOO, §2.6] or [AGS05, 
Lemma 9.4.4]) and seems to have some wider usefulness. We state it in M.'^ for generality. 



Lemma 3.4. Let ncR'^. For n € and v G f7W(17; 



dv 2 
d/u 



d/i = supM [adM + 6-dz/]: a e C6(f2), 6eC6(0;M'^), a+'^<0 \. (3.16) 



In particular, if the right-hand side is finite, then v <^ ji and ^ e i^(fi). 

Proof. Wc write J^(i^|/i) for the left-hand side, and J^'(i^|/i) for the right-hand side. We first show 
that < J^{i'\n). If v is not absolutely continuous with respect to /x, then J^(i/|/x) = cxd, 

and there is nothing to prove; if <C /i, then we can write v = ffi. For all a and b continuous, 
bounded, and satisfying a + |6|^/2 < 0, we have 



[adii + h-dv\= / [a + b- f]dii< 



\b? . \f\' 



djjL < 



\f\' 



dfi ~ T{y\ii). 



To prove the opposite inequality, we assume that T'{y\\i) < oo, and first show that <C /x. 
Suppose not; then there exists a Borel set ^ C f2 such that fi{A) = and ^{A) ^ 0. Take c > 0, 
set a — —cxa, and define a sequence a„ S Cf,(r2) such that a„ f a- Then J a„ d^ — )■ as n oo. 
On the other hand, setting 6„ := y/—2ani^{A)/\v{A)\, we have / 6„ • di/ ^ \/2c|i/(^)| > 0. Since 
c is arbitrary, this violates the finiteness of J"'(z^|/i), and therefore u <^ ^. 

Again writing i' = fn, with / € L'^{fj,y, we now choose 6„ G Cb(fi) such that 6„ — > / in L'^{fj,)'^, 
so that J bn- fdfi ^ J |/p d^ = 2J"(i/|/i). Setting a„ := — ]6„]^/2 we have a„ — )• — ]/]^/2 in L^{ij), 
and therefore / a„d/i — > —^"(1/1/1). The result follows. □ 

The above dual characterization (3.16) of the property G -^^(^) now be used to char- 
acterize the limits in Step 6 of the above proof. 



Lemma 3.5. // m„ — ^ jjL and Wn 



sup 

n Jq Ur, 



dx =: C < OO, 



then u 11 with G i^(0) and 



dv_ 
d/i 



dji < liminf 



n a„ 



(3.17) 



Proof. For each pair (a, 6) as in the right-hand side of (3.16) we have 

C > — - — dx > [aun + b ■ Wn] dx — >■ / [adiJ. + b • di^] . 

Thus, the hypothesis of Lemma 3.4 is satisfied and G L'j^i^) follows. 

Moreover, choosing a pair (a, b) in (3.16) that approximates the left-hand side in (3.17), we also 
obtain the desired estimate (3.17). □ 
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4. Lower bound 

Theorem 4.1 (Lower bound). Under the same conditions as in Theorems 3.1 or 3.2 let us assume, 
without loss of generality, that Pe{t) ^ Po{t) for every t G [0, T]. Then 

Jo(po) < liminf J^(pe) and £o{po{t)) <lirrimi£^{pe{t)) for every t e [0,T]. (4.1) 

Proof. The lower semicontinuity of the entropy functionals under weak convergence is well known, 
see e.g. [AGS05, Lemma 9.4.3] or [ASZ09, Lemma 6.2]. 

Turning to J^, we can suppose by Theorem 3.2 (which contains Theorem 3.1) that 

its Ao = rhoA, dgUe ^ = 5oA, and We />o = "^oA. 
Setting uq = rhoi as in the proof of Theorem 3.2, we also have got = dsiio- By (3.17) we then have 



1 



Js 



Vd/io' Vdun/ 



Vd/io 



d/to < liminf j;(p£). 



We now discard the singular part moA-"- of /to and again write iuo ■= not, by which we find 



•^0 



Js ^""0 



+ 



I) 



dsdt < 



JS 



dt>o 
d/to 



+ 



Uo dsdt < liminf J^ipe)- 



Recalling that the traces of uq a,t s = ±k coincide with Uq , and that u!o{t, s) = woit) is constant 



with respect to s with wq = hu'^, we see that for a.e. t the integrand in the left-hand side of the 



previous inequality satisfies 



M{woit),ut{t)) < - 



uoit,s) uoit,s) 



ds. 



This implies the lower bound on JT^ and concludes the proof of Theorem 4.1. 

5. The minimization problem defining M and interpolation 
The minimization problem defining M is 



□ 



M(w,u=^) := inf 



«H 2 ^ 



+ 



u'{sf 



u{s) u{s) 



ds : u{±k) 



(5.1) 



This minimization problem gives rise to a natural interpolation of the boundary values u , which 
we study in the following theorem. 

Theorem 5.1. Let u[w , u'^]{-) be the solution of the minimization problem M{w,u'^). Then the 
mapping 

{w, u^) H^- u[w, u^] 

is well-defined and continuous from K x (0,oo)^ into C^(S). The function {w,u^) i->- M{w,u'^) 
is convex, smooth away from u^ = 0, minimal at w = and u+ = u~ , and satisfies M{w,u'^) = 
M(m;,u=F). 

Ifu^G ^^([0, T]; [6, oo)) for some d > 0, then the function 



{t,s)^u[-u+{t),u^{t)\ (s) 

is an element o/C^([0,T] x S). 

Proof. By the transformation z = i/u we can rewrite the minimization problem (1.30) as 



(5.2) 



inf 



i 



2z{sy- 



+ 2z'{sf ds : z{±k) 



The corresponding stationarity equation is 

-z" _ ^ _ 
4^3 - 



z{±k) 



(5.3) 



which implies that any solution z is concave and therefore z > minVu^, or u > min'U='=. 
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Since > 0, the existence and uniqueness of the solution u of (5.1), or equivalently of the 

sohition z of (5.3), arc classical, and the continuity follows from classical results for the continuous 
dependence of the solutions of elliptic problems on parameters. Similarly, if is a function 
e C^([0,T])^ and bounded away from zero, then the solution u[u+(-)/2, «=*=(•)] is on its 
domain (note that one degree of differentiation in time is lost since w+ appears as a parameter in 

the equation for z). 

The symmetry and minimality properties of M are immediate. To prove the convexity of M, 
take {wi,uf) and {w2,uf) with M{wi,u^). M{w2,uf) < oo, A e [0, 1], and let Ui and U2 be the 
corresponding minimizers. Since (u,w) w'^/u is convex, it follows that 

(Aw2 + (1 - A)wi)^ . x^-^ 
8(Au2(s) + (l-A)ui(s)) - 8w2(s) ^ ^8ui(s)' 

with a similar inequality for the second term in (5.1). Since Au2 + (1 — A)ui is admissible for 
M(A«;2 + (1 - A)«;i, Au^ + (1 - A)ttf ), we then have 

M(Au;2 + (1 - A)u;i, Xuf + (1 - X)uf) < 

{XW2 + (1 - \)wif (Au'2 + (1 - X)u[f 



-8(Au2(s) + (1 - X)ui{s) 2{Xu2{s) + (1 - A)mi(s) 
= XM{w2,ut) + (1 - X)M{wi, uf). 



ds 



2 „/|'o\2-, 



ii(s) 2ui(s) 



This concludes the proof of Theorem 5.1. □ 

As indicated at the end of the introduction, we can find good lower and upper bounds on the 
integrand M, which are given in the following theorem. 

Theorem 5.2. For all > and all w we have the estimate 

«;(l0gM+ - logM") < M(w;U^) < ^Og^^ -^Og" (4^2^2 ^ / + _ ^-.2\ 

4:k(u+ — u ) 

where both inequalities are equalities if and only ifw = {w^ —u~)/2k. In this case the minimizeru 
in the definition (5.1) of M{w;u^) is the affine interpolation u{s) = {k + s)u'^ /2k+{k — s)u~ /2k. 

Remark 5.3. Note that the left-hand side of (5.4) can be interpreted as 2{£' {w^),w) (see 
Section 1.7), implying that M(w;u±) > 2{£' {u''^) , w) and therefore Jo(m'^) > for all □ 

Proof. We define the functional J{w; u) = ^ Jg ^{w^ + w'^) ds such that M is obtained by mini- 
mizing J{w;u) over all u satisfying the boundary conditions u{±k) = u^. 
The lower estimate follows by neglecting the nonnegative term in 



)) 



f 1 2 f u' 

J(w;u) = / — (w — u')+w — > w(\ogu(K) — log u(—K 
Js^u^ Js u 

and using the boundary conditions. We also see that equality holds if and only if u' = v/2, which 
implies v = 4(m— a). 

The upper estimate is obtained by testing with the affine function u{s) = {k + s)m+/2k + {k — 
s)u^ /2k. Obviously, the lower estimate and the upper estimate coincide for w = (u"*" — u~)/2k. 
Hence, the result is proved. □ 

The fact that optimality occurs at affine functions also gives a characterization of the limit uq 
of a sequence of solutions u^: 

Theorem 5.4. Let he a sequence of solutions of (1.1) such that £e{pe{^)) converges as e ^ 0. 
Then the assertions of Theorem 3.2 hold, and in addition uq is affine in s: 

^ j g ^ g 

for almost all t, s, uo(t, s) = — — uqU, k) H — - — u^it, —k). 

2k 2k 
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Proof. The transformed solutions satisfy the equation 

gsdtUs = dssUe- 

The density concentrates on to the boundary points s = ±k, implying that in the interior of 
the interval S the equation formally reduces to = dggUs- Using classical methods for partial 
differential equations one can convert this observation into a proof that the limit is affine for 
each t. 

Instead we prefer to stay within the realm of the gradient-flow structure. Since the are 
solutions, Aeipe) = 0; by Theorem 4.1 and the assumption of convergence of the initial energies 
£e{pe{^)), we have Aoipo) < 0. Since Ao satisfies condition (1.16), ^o(po) — 0. This implies 
that uq is a minimizer of M for almost all t, and by Theorem 5.2 it is therefore affine for almost 
all t. □ 



6. Recovery sequence 

Theorem 6.1 (Recovery sequence). Let G AC(0,T;R) he such that j7o(u*) < oo. Then there 
exists a sequence Ue € C-^([0,r]xS) such that Ue{-,±K) u"^ in L^{0,T), ^^(^^(O)) Soiu"^ {0)) , 
£e{ue{T)) £o{u^{T)), and X{ue) ^ Mu^)- 

Remark 6.2. By this result the sequence Us and its other forms u^, p^, and p^ converge in the 
different senses provided by Theorem 3.1. □ 

Proof. By a diagonal argument, and using the lower bound (4.1), it is sufficient to prove the 
following approximation result: given ^ > 0, there exists a sequence (uf )e>o with G ^"^([0, T] x 

S) and 

limsup maxj (•, ±«) - m±|Ui(o,t)^, |4(«f (T)) - fo(i^^(T))|, |4(«f (0)) - fo(w±(0))| } < S, 

(6.1) 

and 

limsup J, (uf) - Jo{u^) < 5. (6.2) 

£-)-0 

We now prove this approximation result in several steps. First note that e W^'^{0,T), and 
that the finiteness of J7o(m^) implies that u+ + u~ is constant in time (say 2m) and therefore 
< < 2m and m+ = —ii". To simplify we only specify the value u" at —k, and consider the 
corresponding value u+ at +k as defined by the condition of constant mass. 

We first approximate u~ by a function that is bounded away from zero and from 2m. We do 
this by setting y~ := m + {l — r]){u'~ —m), for some small 77; as ry — )■ 0, y~ — >■ u~ in W^-'^'^(0, T). The 
function y~ is bounded away from and 2m; the convexity of M and the fact that it vanishes when 
w = and u'^ = u~ (Theorem 5.1) imply that for almost all t, M{y^ {t) /2;y^{t)) is decreasing 
in r/, and that M{y+ {t)/2;y^{t)) t M(u+(t)/2; ?i±(i)) as r/ 4, 0. This implies that 

^ M{^yr,{t)+;y^it))dt~^ M(^«+(t);u±(i)) dt as 77^0. 

Similarly £o{y^{T)) £o{u^{T)) and £o{y^{0)) £o{u^{0)), implying that for given ^ > we 
may choose t] > such that 

max{ \\y^ -u^\\mo,Tr, \£oiy^{T)) - £oiu^iT))\, 

\£o{y^{0))-£o{u^m\, iMvt) - Mu^)\ } < I- (6-3) 

We fix this number rj. 

The next step is to smoothen y^. We approximate j/^ in W^'^{0,T) by convolution to give a 
y G C^([0, T]), while preserving the pointwise upper and lower bounds. Because M is convex, it 
follows that 

My^) < My^), (6.4) 
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and we can choose y such that 

S 



max 



{ \\y^-y^\\LHo,Tr, \So{y^{T)) - £o{y^{T))l \£o(y^m-£o{y^ml } < -. (6.5) 

We now interpolate y by Theorem 5.1 without changing notation; note that then y G ([0, T] x 
S). Since y is fixed and C^, it follows that as — > 0, the corresponding function w^, defined by 
dt{y^s) + dsws = 0, is uniformly bounded and satisfies 



Vse [-«,«), Ws{t,s) = J y{t,a)%{da) ^ -y{t,- 



Therefore 

/-T p ,„2 



f f w-' f f 1 ■- 

i i = ,/„ is T = 2? W' 

and we have for sufficiently small e > that 

\X{y)-My^)\<\. (6.6) 

Similarly, since 7e ^ 7o, we have for sufficiently small £ > that 

max{ %{y{T))-eo{y^{T)% |4(y(0)) - £0(^^(0))! } < \. (6.7) 

The final step is to approximate y by a function of the right mass. Since y G C^([0, T] x S), the 
mass discrepancy 

m{t):= / y{t,s)%{ds)- / y{t, s) 'joids) 
Js Js 
converges to zero uniformly on [0,T]. Setting 

ut{t,s) := y{t,s) -m{t), 

we find that for sufficiently small e 

max{ ||wf(-,±K)-^||ii(o,T)=, \i,{4{T)) - i,iyiT))\, 

6 



\£Mm-£eiyml \Jei4)-Jeiy)\ } < ^- (6-8) 



The claims (6.1) and (6.2) then follow from combining the estimates (6.3), (6.4), (6.5), (6.6), (6.7), 
and (6.8). □ 

7. Connections with stochastic particle systems 

The mathematical results of this paper make important use of Definition 1.1 of a gradient-fiow 
solution. This formulation is more than a mathematical convenience: it arises naturally when 
considering equation (1.1) as the deterministic limit of a stochastic system of particles. We now 
explain this connection and its consequences. 

Fix £ for the moment. Consider a collection of n independent particles, each of which performs 
a Brownian motion in a potential landscape given by the energy function H/e. Equation (1.1) 
is the continuum or hydrodynamic limit of this system of particles, as the number n of particles 
tends to infinity. One way of describing this limit is by considering the empirical measure 



1 " 

Ln:[0,T]^9^{[-l,l]), L„(t):=-^% 



1=1 



where ^i{t) is the position in [—1, 1] of particle i at time t. As n — >■ oo, with probability 1 this 
empirical measure converges weakly to a limit measure p{t) at every time t, and this limit measure 
solves the equation (1.1).^ 



^We deliberately disregard the role of initial conditions for the moment; one could, for instance, choose a single 
delta function Sa as the initial datum for (1.1), and have all particles start at ^ = a. 
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Given this connection, a large- deviation result characterizes the probabihty of finding the em- 
pirical measure Ln{t) far from the solution p of (1.1). Such a result roughly takes the form 

P„(L„ « p) - exp(-n/(/5)), (7.1) 

in terms of a rate functional I. A rigorous version of this statement can be found, for instance, 
in [FK06, Th. 13.37]. 

The surprising feature, however, is that for this system of particles, the rate functional / above 
is exactly equal to the functional Ae in (1.25) (see e.g. [KO90] or [FK06, Th. 13.37]). This feature 
has several consequences. 

(1) Since Ae = I, the large-deviation result (7.1) gives an alternative explanation why Ae >0 

and why Ae = implies a solution of the deterministic system. The positivity of /, and 
therefore of Ae, arises directly from the property (7.1) and the fact that probabilities 
are bounded by 1; and since the hydrodynamic limit is assumed with probability 1, the 
solution p of the limit equation (1.1) necessarily satisfies Ae{p) = I{p) = 0.^ 

(2) In Section 1.2 we mentioned that there exist at least two different gradient-flow structures 
for equation (1.1). The fact that one of these structures arises in the large-deviation 
description of this stochastic system, may be interpreted to signify that this gradient- 
flow structure is more 'natural' — at least when we view (1.1) as arising from this specific 
stochastic particle system. Of course, there may well be a different stochastic system 
whose large-deviation behaviour is related to the structure (1.18), and there may be other 
arguments that favour other structures. 

(3) This connection provides an answer to the question, often heard, 'why does the Wasser- 
stcin metric figure in this gradient-flow structure?', since the Wasserstein dissipation arises 
directly from the large-deviation behaviour. However, a complete answer requires describ- 
ing the large-deviation result in some detail, which would take us too far; see [ADPZIO] 
for a detailed discussion. 

(4) In the context of a large-deviation result, it is natural to consider sets of the form {p : 
I{p) < <5} for (5 > 0; these correspond to collections of 'least unlikely' states, in the sense 
that their probability vanishes no faster than e""''. Again, this ties in with the results 
proved above, in which we do not assume Ae = 0, but only boundedness of £e{pe{Q)) 
and JeiPe)- 

The connection with large-deviation principles also explains the structure of M in (1.30). The 
well-known contraction principle describes how rate functions transform under projection, i.e. 
under loss of information. Suppose that 7 is a rate function describing the behaviour of a sequence 
of probability measures P„ on a space X, in the sense of (7.1). Let p : X ^ Y he a continuous 
map, and Qn •= Pn°P~^ the corresponding probability measures describing the behaviour of the 
system after projection under p onto Y. Then Q„ satisfies the large-deviation principle [dHOO, 
Th. III.20] 

Qn{y) ^ c^vi-nlply)) with Ipiv) ■= inf /(x). 

xGX:p{x)—y 

The form of the ftmction M can be imdcrstood in terms of this contraction principle. In the limit 
e = 0, the only information about Ue or iie that survives are the boundary values. Consequently 
the large-deviation behaviour of the system in the limit follows from the contraction principle by 
interpolating between the boundary values, in such a way as to minimize the functional over all 
missing information. The function M is the direct consequence of this. 

8. Discussion 

Passing to the limit in gradient flows. The aim of this work is to explore the potential of the 
Wasserstein gradient-flow structure of (1.1) for rigorous passing to the limit. For this specific 
system, we have succeeded to a large degree, and we comment below on the specific assumptions 
that we have made. 



■^A careful treatment of this argument actually requires a more precise definition of (7.1) and a discussion of 
topology; we omit both. 
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The property >t = is a reformulation of the concept of a curve of maximal slope, which was 
introduced by DcGiorgi and co-authors (sec e.g. [DGMT80]) as a metric-space generahzation of 
a gradient flow. Sandier, Serfaty, and Stefanelli [SS04, Ste08] appear to be the first to explore 
in detail the use of this structure for passing to the limit. Serfaty [Ser09] discusses the case of 
metric spaces, with obvious applications for the case of the Wasserstcin metric. She leaves aside 
the question of compactness, however, and one of the main contributions of this paper is to show 
that appropriate compactness 'in time' can also be obtained from the Wasserstein structure. 

A related result is that of Ambrosio, Savare, and Zambotti [ASZ09], who study entropy- 
Wasserstein gradient flows in a Hilbert space with respect to a weakly converging sequence of 
reference measures. Their approach first proves convergence of time-discrete approximations for 
fixed time, and then uses error bounds to prove convergence of the time-continuous solutions. 

Assvmptions. The assumptions in the main theorems arc the boundedness of the initial energy 
and of the dissipation function Jg, both of which are natural objects in the Wasserstein gradient 
fiow. The relaxation of the condition >te = to the condition sup- < oo is a broadening of 
scope: it implies that the compactness result holds not only for solutions, with their accompanying 
higher-regularity properties, but for a much wider class of sequences. In addition, this class 
arises naturally in the context of large deviations for an underlying stochastic particle system (see 
Section 7). 

However, a central tool is the mapping ^ i— >■ s, which desingularizes the diffusion term and allows 
for a more detailed study of the limit behaviour. This mapping is very specific for this problem, 
and it is an interesting question how to generalize it to singular systems described by different 
PDEs (e.g. higher-order [Ott98, GOOl, Gla03] or nonlocal parabolic equations [CMV03, CMV06]) 
or more complicated geometric spatial structure. 

Weak formulations and com,pactness. A related question arose during the work presented here: 
can our definition of a gradient-fiow solution, Definition 1.1, be viewed as a weak form of the 
gradient-flow equation (1.8), 

z = -VcSiz)-! (1.8) 
The straightforward answer to this question seems to be negative, since traditionally weak for- 
mulations serve to reduce regularity requirements, and in both cases the function z is necessarily 
differentiable. Therefore shifting from (1.8) to Definition 1.1 brings no advantage on that front. 

However, we argue here that a different aspect is just as important: the compactness and 
convergence properties of the formulation. As we have shown, solutions of a sequence of problems 
are compact in an appropriate way, and a subsequence converges to a limiting object that can 
be considered an appropriate generalization of a gradient flow (see Definition 1.1, and also the 
discussion in Section 7). 

This ties in with the strongly related work of Herrmann and Niethammer [HNll] , that we discuss 
separately below. One aspect that this paper and [HNll] have in common is the reformulation of 
a nonlinear, singular differential equation as a parameter-dependent variational problem, thereby 
opening the door to methods of variational calculus. 

Choice of convergence. We prove Gamma-convergence of the sequence of functionals J^. If 

one is only interested in convergence of solutions, then this is actually too strong: it suffices to 
prove the lower bound inequality. Theorem 4.1. We prove the recovery sequence. Theorem 6.1 
nonetheless, especially since it completes the picture of the convergence on J^. 

Incidentally, the fact that Gamma-convergence is a natural form of convergence for large- 
deviation rate functionals (see Section 7) has also been recognized in the probabilistic litera- 
ture [Leo07]. 

The micro-problem. The transformation to the new spatial variable s has the effect of blowing 
up the region in which the derivative of is large. The resulting function is more regular, 
as reflected by the i/^-bound on y/ii^ (see Figure 4 and Remark 2.1). This blow-up argument 
is reminiscent of the 'cell problem' in homogenization [Hor97] or the 'inner' and 'outer layers' 
in singular perturbation theory [Ver05]. Similarly, the parallel convergence results of those two 
theorems reflect these separate behaviours at two different scales. 
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It is interesting to note, however, that for the lower-bound inequahty one does not need to 
know much about the function M; actually, only its definition. Some additional information 
(the inequalities of Theorem 5.2) is necessary to identify solutions of = as solutions of a 
corresponding differential equation. Other additional information (Theorem 5.1) is necessary for 
the recovery sequence, Theorem 6.1. 



This is not the first paper to give an answer to the question that was raised in [PSVIO], Can 
we prove convergence using the Wasserstein gradient flow? In [HNll], Herrmann and Nietham- 
mer give a different (but again affirmative) answer. Here we briefly describe their approach and 
comment on the differences. 

The authors of [HNll] build upon a solution concept for gradient flows based on an integrated 
form of the Rayleigh principle. This concept has been used before in nearly-flnite-dimensional 
situations [NOOl, NOlO], but its application in a truly parabolic context appears to be new. We 
describe it here in the case of a linear space Z; the generalization to a manifold is straightforward. 
Given an energy functional £ on Z and a Riemannian metric G (see the introduction), it is a 
straightforward observation that if 2: is a solution of the gradient-flow equation (1.8), then its time 
derivative z{t) at time t is a minimizer of 



Inspired by this, the authors of [HNll] define an integrated Rayleigh principle as follows: an abso- 
lutely continuous function z : [0,T] ^ Z satisfies this principle if its time derivative z minimizes 
the functional 



among all v : [0, T] Z. 

In [HNll], the authors first remark that at finite e > 0, the solution of (1.7) is a minimizer 
of this integrated Rayleigh principle. In addition, the a priori estimates of [PSVIO] provide 
appropriate compactness of the sequence Ug. The central result is then that the integrated Rayleigh 
principle for the limiting function uq can be derived from the same principle for the solutions u^. 

The work by Herrmann and Niethammer is interesting for various reasons. First, this solution 
concept merits to be considered more closely, and we make some comments on this below. Next, the 
authors themselves state as a drawback that the compactness that they use docs not derive from 
the Wasserstein gradient-flow structure, but from the linear semi-group structure used in [PSVIO] . 
They pose in turn the question whether the compactness can be derived from the Wasserstein 
structure. And flnally, what is exactly the relationship between the solution concepts of [HNll] 
and of this paper, and similarly of the convergence theorems of the two papers? 

There is a problem with the definition of the 'integrated Rayleigh principle' for a general 
function z. Take the example of a Hilbert space H and a continuous semigroup generated by a 
non-negative self-adjoint operator A, which solves the equation z = Az in H. This is a gradient 
flow with {z,Gy) = iz,y)H and £{z) = \{z,Az)h- If z{t) ^ D{A^/'^) at f > 0, then £{z{t)) = 00 
and D£{z{t)) is not well defined. Another way of stating this is that the right-hand side of (9.1) 
is not bounded from below, and its infimum equals —00. This in turn implies that even though z 
might minimize (9.2) for fixed z{-), in the neighbourhood of that z there exist perturbations z, 
arbitrary close to z, for which the infimum equals —00. Therefore the formulation (9.2) is very 
unstable under perturbations of z. 



9. Comparison with a paper by Herrmann and Niethammer [HNll] 



V ^ {v,G{z{t))v) + {D£{z{t)),v). 



(9.1) 




(9.2) 
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It is no coincidence that the expression (9.2) is closely related to the functional A. We can 
write 

A{z) = £iziT))-£iz{0)) + £^^{z,G{z)z}+^-{-D£iz),G{z)-\-D£{^^^^ 

= £ (^^{z, G{z)z) + {D£{z), i) j dt+^ £(^-D£iz), G{z)-\-D£{z))) dt. (9.3) 

This form shows that Definition 1.1 with the structure (1.11) is different from the integrated 
Rayleigli principle above in two ways: first, the integral (l)£(z(t)), £(1)") dt has been converted 
into the end point values £{z{T)) — £{z(Q)), and secondly, the addition of the dual dissipation 
potential ip*{—D£(z)) (the second term in (9.3)) penalizes 'non-regular' values of z{t). 

Both of these changes appear to improve the robustness of the formulation. The addition of 
the dual potential has the effect of penalizing 'unfavorable' choices for the function z; and the 
conversion of the cross term into end point values mitigates the effect of fast oscillations. The 
compactness results of Theorems 3.1 and 3.2 certainly suggest that Definition 1.1 and (1.11) 
provide a useful basis for the analysis of these more general gradient flows. 
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